A NEW REM CONJECTURE 



G. BEN AROUS, VERONIQUE GAYRARD, AND A. KUPTSOV 

Abstract. We introduce here a new universality conjecture for levels of ran- 
dom Hamiltonians, in the same spirit as the local REM conjecture made by 
S. Mertens and H. Bauke. We establish our conjecture for a wide class of 
Gaussian and non-Gaussian Hamiltonians, which include the p-spin models, 
the Sherrington-Kirkpatrick model and the number partitioning problem. We 
prove that our universality result is optimal for the last two models by showing 
when this universality breaks down. 



1. Introduction 

S. Mertens and H. Bauke recently observed ( [MerOOj . [BM04] . see also [BFM04] ) 
that the statistics of energy levels for very general random Hamiltonians are Pois- 
sonian, when observed micro-canonically, i.e. in a small window in the bulk. They 
are universal and identical to those of the simplest spin-glass model, the Random 
Energy Model or REM, hence the name of this (numerical) observation: the REM 
conjecture or more precisely the local REM conjecture. 

This local REM conjecture was made for a wide class of random Hamiltonians 
of statistical mechanics of disordered systems, mainly spin-glasses (mean field or 
not), and for various combinatorial optimization problems (like number partition- 
ing). Recently, two groups of mathematicians have established this conjecture in 
different contexts : C. Borgs, J. Chayes, C.Nair, S. Mertens, B.Pittel for the num- 
ber partitioning question (see [BCPOlj . |BCMN05a] . |BCMN05bj ) . and A. Bovier, 
I. Kurkova for general spin- glass Hamiltonians (see |BK06aj . [BK06b]). 

We introduce here a new kind of universality for the energy levels of disordered 
systems. We believe that one should find universal statistics for the energy levels of 
a wide class of random Hamiltonians if one re-samples the energy levels, i.e. draws a 
random subset of these energies. Put otherwise, our conjecture is thus that the level 
statistics should also be universal, i.e Poissonian, when observed on a large random 
subset of the configuration space rather than in a micro-canonical window. We 
establish this new universality result (which could be called the re-sampling REM 
universality or the REM universality by dilution) for general (mean-field) spin-glass 
models, including the case of number partitioning and for large but sparse enough 
subsets. This approach has the following interesting property: the range of energies 
involved is not reduced to a small window as in the local REM conjecture. Thus 
we can study the extreme value distribution on the random subset, by normalizing 
the energies properly. Doing so we establish that the Gibbs measure restricted to 
a sparse enough random subset of configuration space has a universal distribution 
which is thus the same as for the REM, i.e. a Poisson-Dirichlet measure. 
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To be more specific, we specialize our setting to the case of random Hamiltonians 
(i?Ar(cr)) (T6 s 1 v defined on the hypercube Sn = {— 1,1}^. We want to consider a 
sparse random subset of the hypercube, say X, and the restriction of the function 
-Hjv to X. We introduce the random point process 

with a normalization: 

= H N (a)-a N 
on 

to be chosen. Our conjecture specializes to the following: the asymptotic behavior 
of the random point process Vn is universal, for a large class of Hamiltonians Hn, 
for appropriate sparse random subsets X, and appropriate normalization. 

We will only study here the simplest possible random subset, i.e. a site perco- 
lation cluster X = {a € Sn ■ X a = 1}, where the random variables (X^^Sn are 
i.i.d. and Bernoulli: 

2 M 

V{X a = l) = l-V(X a = Q)=: PN = ^ T . (1.3) 

Thus the mean size of X is 2 M and we will always assume that X is not too small, 
e.g. that logiV = o(2 M ). We will sometimes call X a random cloud. 

In order to understand what the universal behavior should be, let us examine the 
trivial case where the Hn(o~) are i.i.d centered standard Gaussian random variables, 
i.e. the case of the Random Energy Model. Then, standard extreme value theory 
proves that if 

a N = v / 2Mlog2 + 21og6jv - log 2 and b N = \{jj> ( 1A ) 
then Vn converges to a Poisson point process with intensity measure 

H(dt) = -4=e-*V^2(ft. (1.5) 

We will now fix the normalization needed in (|1.2[) by choosing aN and bN as in (jl.4p . 
The basic mechanism of the REM universality we propose is that the influence of 
correlations between the random variables H' N (a) should be negligible when the 
two-point correlation (the covariance) is a decreasing function of the Hamming 
distance 

d H (a,a')=#{i<N -.ai^o-i} (1.6) 

and when the random cloud is sparse enough. We first establish this universality 
conjecture for a large class of Gaussian Hamiltonians. This class contains the 
Sherrington-Kirkpatrick (SK) model as well as the more general p-spin models. It 
also contains the Gaussian version of the number partitioning problem. 

Consider a Gaussian Hamiltonian Hn on the hypercube Sn = {— 1, 1}^ such 
that the random variables (777\r(a')) (T g5 JV are centered and whose covariance is a 
smooth decreasing function of the Hamming distance or, equivalently a smooth 

N 

increasing function v of the overlap R(a, a') — jt a^a^ : 



N 
i=l 



cov(H N (a),H N (a')) = v{R{a,a')) = v(l - Mg ^ ) ), (1.7) 
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We will always assume that ^(0) = and that v(\) = 1. The first assumption is 
crucial, since it means that the correlation of the Hamiltonian vanishes for pairs of 
points on the hypercube which are at a typical distance. The second assumption 
simply normalizes the variance of H^{a) to 1. 

This type of covariance structure can easily be realized for v real analytic, of the 
form: 

p>i 

Indeed such a covariance structure can be realized by taking mixtures of p-spin 
models. Let -ff/v.p be the Hamiltonian of the p-spin model given by 

H N ,p( a ) = N (p-l)/2 ^2 5ii,i 2 ,...,i P cr H cr i2 ■ ■ -<Ti p , (I- 9 ) 

l<il ,22 ,...,i p <N 

where random variables (9i 1 ,i a ,...,i p )i<i 1 ,ia,...,i p <N are independent standard Gaus- 
sians defined on a common probability space (f2 9 , J- 9 ,P). Then 

H N (a) = -^=J2 a P H "A<r) (1-10) 



N p>i 

has the covariance structure given in (|1.7p - (|1.8[) Let us recall that the case where 
v(r) — r is the Gaussian version of the number partitioning problem ( B CPOl"] ). 
the case where v(r) = r 2 is the SK model, and more generally, v{r) — r p defines 
the pure p-spin model. 
Let us normalize Hm as above (see (| 1 .4[) ) : 

H n(p) = 7 , (1-11) 

on 

and consider the sequence of point processes 

Theorem 1.1 (Universality in the Gaussian case). Assume that M = o{s/N) if 
f'(0) 7^ 0, and that M = o(N) if = 0. Then, P-almost surely, the distribution 
of the point process Vn converges weakly to the distribution of a Poisson point 
process V on K with intensity given by 

H{dt) = -^e-tV^^dt. (1.13) 

Remark 1.2. The condition logiV = o(2 M ) is needed in order to get P-almost sure 
results. 

We extend this result, in Section^ to a wide class of non-Gaussian Hamiltonians 
(introduced in |BCMN05aj for the case of number partitioning). 
The theorem has the following immediate corollary. Let us fix the realization of 
the random cloud X. For configurations a belonging to the cloud we consider the 
Gibbs' weights Gn,/}^) of the re-scaled Hamiltonian H' N (a) 

e -PH' N (a) e -P^MH N (a) 
G N g(a) = 3T77-TT = 7= • (1-14) 

,PK ' £ e -0H' N { 8 ) e -pVMH N ( ) y 1 

g£X geX 
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Reordering the Gibbs' weights Gjv,/3(c) of the configurations a G X as a non- 
increasing sequence (w a ) a <\x\ an d defining w a = for a > \X\ we get a random 
element w of the space S of non increasing sequences of non negative real numbers 
with sum less than one. 

Corollary 1.3 (Convergence to Poisson-Dirichlet). If (3 > v / 21og2 then P-almost 
surely under the assumptions of Theorem ] 1 . 1\ the law of the sequence w — (w a ) a >i 

converges to the Poisson-Dirichlet distribution with parameter m — ^ 21 ° g 2 n S. 

The fact that Theorem 11.11 implies Corollary 11.31 is well-known, see for instance 
[Tal03] (pp. 13-19) for a good exposition. 

It is then a natural question to know if our sparseness assumption is optimal. 
When the random cloud is denser can this universality survive? We show that our 
sparseness condition is indeed optimal for the number partitioning problem and for 
the SK model, and that the universality does break down. 

Theorem 1.4. [Breakdown of Universality for the number partitioning problem] 

(i) Let v(r) = r. Suppose that limsup A '^P < oo. Then P-almost surely, the 
distribution of the point process Vn converges to the distribution of a Poisson point 
process if and only if M = o(v^V). 

[Breakdown of Universality for the Sherrington- Kirkpatrick model] 

(ii) Let v(r) — r 2 . Suppose that limsup M ^ < § t * 2 ■ Then P-almost surely, the 
distribution of the point process Vn converges to the distribution of a Poisson point 
process if and only if M = o(N). 

We prove this theorem in Section 0] by showing that the second factorial moment 
of the point process does not converge to the proper value. The case of pure p-spin 
models, with p > 3, or more generally the case where ^'(0) = i/'(0) = differs 
strongly (see Theorem I4.7[) . The asymptotic behavior of the first three moments 
is compatible with a Poissonian convergence. Proving or disproving Poissonian 
convergence (or REM universality) in this case is still open, as it is for the local 
REM conjecture. 

The paper is organized as follows. In Section [2] we establish important combina- 
torial estimates about maximal overlaps of ^-tuples of points on the random cloud. 
We give a particular care to the case of pairs (I = 2) and triples [I = 3) which 
are important for the breakdown of universality results. In Section [3] we establish 
the universality in the Gaussian case (Theorem ll.l|) . We then prove, in Section 2J 
the breakdown of universality given in Theorem 11.41 Finally we extend the former 
results to a wide non-Gaussian setting in Section [5l 



2. Combinatorial estimates 
In this section we fix an integer I > 1 and study the maximal overlap 

R m ^(a\...,a e )= max \R(a\^)\. (2.1) 

l<i<j<£ 

For fixed TV £ N and R € [0, 1) let us define the following subsets of 

U t (R) = {{<t\ ...,a e ): R max (a\ ...,a e )< R} (2.2) 
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and 

V t (R) = {{a\ ...,a e ): R max (a\ ...,a e )= R}. (2.3) 

More generally, let the sequence Rn G [0, 1) be given (respectively, the correspond- 
ing sequence of Hamming distances djv = -y (1 — Rn)) and introduce the sequence 
of sets Ui(Rn) and Vi{Rn) which we denote for simplicity of notation by Un,h and 
Vn,i respectively. 

For a set Y C S N we denote by Y x its intersection with X 1 . In the following 
theorem we study the properties of the sets 

Ujfj = \ • • • , o*) G X 1 : i? max ( ( r 1 , • ■ • , <J £ ) < Rn} (2.4) 

and 

Vni = {(°\ ■■■,a t )eX l : R max (a\ ...,a e ) = R N }. (2.5) 
In order to state the main result of this section (Theorem l2.1[) we define the function 

J( X ) = { + log(l +x) if x G [-1, 1], (2 6) 

[ -(-oo otherwise. 

Theorem 2.1. Lei i/ie sequence Rn be such that NRj^ — ► oo. 
(i) TVien V-almost surely 

|C/^|=E|C/^|(1 + (1)). (2.7) 

fnj //-Rjv = o(l) and M(iV) > logJV then there exists a £ [0,1) and C > 0, 
depending only on £, such that Y-almost surely 

\Vfa\ <Ce aNJ ^E\V^ t \. (2.8) 

Proof. The proof is based on standard inequalities for i.i.d. random variables that 
result from exponential Chebychev inequality. We formulate them without proof: 
let (Xi)i<i< n be i.i.d. Bernoulli rv's with F{X t = 1) = 1 - F(X t = 0) = p and let 

n 

z = Y^ X i- Then > for 4 > °> 

i=l 

¥(Z -EZ> m.Z) < e -< p{1+t) log(i+t)+(i-p(i+ t )) log l=g±H) f (2 g) 

P(Z - EZ < -£EZ) < e^ 1 "*) iog(i-t)+(i-p(i-t)) log i=g=H) (2 1Q) 

If p = — > and t = t(n) — > as n — > oo, the above inequalities imply that, for 
large enough n, 

P(|Z-EZ| > <EZ) < 2e^ np * 2/4 , (2.11) 

whereas if p(n) — ► 0,t(n) — > oo, and p(n)t(n) — > we get from (|2.9p that for large 
enough n 

P(Z - EZ > <EZ) < e -5™p(!+*) iog(i+«) _ (2.12) 



The proof of Theorem 12.11 relies on the following elementary lemma that again 
we state without proof. 

Lemma 2.2. (i) For any sequence Rn € [0, 1) 

\U N/ \>2 m (l-2(^\e-* NR »Y (2.13) 
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(ii) Suppose Rn satisfies NR 2 ^ — * oo and Rn = o(l). Then for some C > 
depending only on £, 

\V N j\ = 2 Nl -^=e- NJ ^\l + o(l)). (2.14) 
v N 



As an elementary consequence of part (i) of Lemma |2"T21 one can prove that: 



Corollary 2.3. P-a.s. max \R(a,a')\ < S N , where S N = ±J M{N) N lo&2 + ]£ jr L . 

The proof of part (i) of Theorem l2.1l thcn proceeds as follows. Let us first express 
the size of the random cloud \X\ as a sum of i.i.d. random variables 

i*i = J2 ^=i- ( 2 - 15 ) 

Using (|2.11[) and the assumption that log N — o(2 M ), we see that P-almost surely 
|*| is given by its expected value, i.e. \X\ = 2 M (1 + o(l)). Therefore \X e \ = 
2 M «(l + o(l)). 

Since C X e and E|C/^| = jf N \U N ,t\ = 2 Mf {l + o(l)), then proving part (i) 
of Theorem 12. II is equivalent to proving that the set e coincides, up to an error 
of magnitude o(2 Me ), with the set X 1 . Let us rewrite e as 

U^= f| ( u N*)ji', (2-16) 

l<j<j'<£ 

where we defined 

(U§) jf = {(a 1 , . . . , a 1 ) e X? : \R&,o*')\ <R N }. (2.17) 

If we prove that every set (t/jy e )jj> coincides, up to an error of order o(2 Me ), with 
the set X , then the representation (|2.16|) implies part (i) of the theorem. We 
therefore concentrate on proving that 

\iPh)ii' I = E l(^v)*v|(l + (l)), P-a.s. (2.18) 
Without loss of generality we can consider the case of j = 1 and f = 2. By definition 



1(^)121 = ( £ 1x^=11^=1 j ( E 1 



,- 2 



\Flia 1 ,a 2 )\<R 



-2 



= E ^1=1 E E !*.=i ■ (2 - 19) 

As we already noted, the sum in the second factor of (|2.19j) concentrates on its 
expected value and is equal to 2 M( ^~ 2 ) (1 + o(l)). Let us thus turn to the first factor 
in (12+91) . 

Introduce the set {U$ 4 ) a i = {a 2 € X : |-R(ct\ a 2 )\ < R N }- Then 

k^vi= E 1 ^=i ( 2 - 2 °) 

<t 2 :|,R(<tV 2 )I<-Rn 
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The summands in this sum are i.i.d. and it follows from part (i) of Lemma 12.21 
that their number is at least 2^(1 - 2e~i NR ^) = 2 N (I + o(l)). Applying ([2~TT]) 
together with the assumption that log TV = o(2 M ) we obtain from Borel-Cantelli 
Lemma that P-a.s., for any a 1 € Sn, 

|(f/^) ffI | = (l + (l))E|(C/^) al |. (2.21) 

From (|2~Tgi) . (j!T2T)]) and (j2~2Tj) we immediately conclude that 

\(U%, e )i2\ = (1 + o(l))E\(U*z) 12 \, P-a.s. (2.22) 

This finishes the proof of part (i) of Theorem 12. II 

The proof of part (ii) is quite similar to the proof of part (i). By definition of 

V£ e we get 

Vg A Q |J (V£ t ) jf , (2.23) 
i<j<j'<e 

where 

(V&w = {(<r\ ...,a £ )eV^ e : \R^,^')\ = R N ). (2.24) 

We claim that it suffices to prove that P-almost surely 

\Wfo)jA<e^ RN M<yN*)ji'Y (2-25) 
Indeed, from (|2.23p and from the above inequality we obtain that 

\Vle\< E K V N,t)jA< E e aNJ{RN) n{Vli)jy\- (2-26) 
i<j<j><£ i<j<j'<e 

Using part (i) of Lemma [2~2l it is easy to establish that for all 1 < j < j' < I 

1^1 = 1(^)^1(1 + 0(1)), (2.27) 

and therefore 

E|V^ £ |=E|(y^) jy |(l + (l)). (2.28) 

Since (|2.26p and (|2.28p imply the result we concentrate on the proof of (|2.25p . 
Without loss of generality we can take j = 1 and j' = 2. Then, by definition of 




(2.29) 



As in the proof of part (i) we see that the second part of (|2 . 29[) concentrates on its 
expected value and equals to 2 M ^ -2 )(1 + o(l)). We are thus left to treat the first 
part. Introducing the set 

(VfoV — W 2 e X '■ \R(<r\° 2 )\ = Rn} (2.30) 

it is clear that 

\(V^\= E (2.31) 

<t 3 :\R(<t 1 ,<t'>)\=Rn 
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There are 2( d ) i.i.d. terms in the above sum. Applying (|2.12|) with t + 1 = 
e aNj{R N ) wnere a g [o j 1) w iH De chosen later, we obtain 

H\(y£ t )*i\ > e aNJ ^E\(V^)^\) < e -i E l^)^l( 1 + t ) 1 °g( 1 + t ). (2.32) 
From Stirling's formula we see that for some C > 0, 

n(V^\=2 PN (J) =2 M -^ e -^-)(l + (l)), (2.33) 



and thus the exponent in (|2.32|) is 

\n{Vl^ |(1 + t) log(l +t) = -^= 2 M e~^ N ^ aNJ(R N )(l + (1)) 



= ^ e Mlog2-(l-a)Ar i 7( Rw )-i log Nxj^q + q(1)) _ (2 34) 

If for every c 1 G Sat the set (Vjv^)a- 1 is empty then there is nothing to prove. 



Otherwise, by Corollary 12.31 we obtain that P-almost surely Rn < Sn, and since 
J{x) = \x 2 {\ + 0(x 2 )) near the origin we can choose a G [0, 1) in such a way that 

M log 2 — (1 - a)NJ(R N ) - - \ogN > 7 log TV (2.35) 

for some 7 > 0. Hence £ e -2 E K^ CT il(i+*)l°g(i+*) < OCj an d we obtain from (|2~32l) 
and Borel-Cantelli Lemma that P-almost surely 

K^.^e^'EK^)^!. (2-36) 

It is easy to see that (|2.36p and (|2.29[) imply (|2.25[) . This concludes the proof of 
Theorem O □ 

In part (ii) of Theorem 12.11 we studied the properties of the sets Vj$ t C X for 



arbitrary I > 2. For f = 2 we can improve on Theorem 12. II 

Theorem 2.4. Suppose that limsup t£ < 1. 
(%) If for some c\ > \ 

* ^ - ^ («') 

i/ien P '-almost surely 

lV£ 3 | = (l+o(l))E|y£ a |. (2.38) 
(2?) If for positive constants Ci,C2 

Mlog2 dlogTV Aflog2 calogJV 

— ^ - ^ (i?Ar) - ^v - + ^v - (2 - 39) 

t/ien </iere is a constant c such that P-almost surely 

IV*,! < iV c E|V# 2 |. (2.40) 



(m,) If for some C2 > 3 



2 



W>^ t ^ (2.41) 



t/ien i/ie sei V^ 2 is P-almost surely empty. 
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Proof, (i) By definition of V^ 2 > 

c^eSjv ff 2 eSjv:|fl(<rl, o- 2 )| = -Rjv 

Using (|2.30p the inner sum is |(V^ 2 )<r 1 l- Since it is a sum of i.i.d. random variables 
then for all t = t(N) = o(l) we get from (POT]) that 

P(ll(^ 2 ) CTl |-E|(^ 2 ) CTl ||>tE|(V^ 2 ) CTl |) <2e- t2 ^ v ^/\ (2.43) 
Using Stirling's approximation 

V2^n n+ 5 e -"+™ < n \ < V2^n n+ 5e-"+T^ (2.44) 

we obtain that 

Hm&V|-*»(J-0(-^j= w -). (2.45) 

Further, from (|2"U?j) and Corollary [23] we obtain that E|(V^ 2 ) CT i| > CW^ 1 / 2 for 
some positive constant C > 0. Choosing i = 7V~ 7 for some small enough 7 > in 
(|2.43| , we conclude from Borel-Cantelli Lemma that 

|(V^ 2 ) CT i| =E|(V^ 2 ) CTl |(1 + o(l)), P-a.s. (2.46) 

Using (|2Tl2"|) . (|2Tl6l) . and the fact that P-almost surely there are 2 M (1 + o(l)) 
configurations in the random cloud X proves (i). 

(ii) The proof is similar to the proof of part (i). In particular, from the repre- 
sentation (|2.42|) it is easy to see that it suffices to prove that 

KV&W < N c E\(V* 2 )„i\, P-a.s. (2.47) 

Choosing 1 + 1 = N c we get from (|2.45j) that for some positive constant C 

^l(^- 2 )^l(l + *) log(l +t)> CN^- 1 ' 2 log N. (2.48) 

Choosing c large enough and applying (|2.12[) together with Borel-Cantelli Lemma 
proves part (ii). 

(Hi) We again use the representation (|2.42p . Clearly it is enough to prove that 
for all a 1 £ X the set (V* 2 ) a i is P-almost surely empty. By (|2.45|) E|(Vjy < 
CN~ C2 ~ X / 2 for some positive constant C. Thus, choosing 1 + 1 = ,.\ . — , — > 00 
we obtain from (f27T2|) 

P(l(V^ 2 ) CTl | > 1) =P(|(Vj£ a VI > (l + t)E|(U^ 2 ) CTl |) < (t+1)- 1 / 2 . (2.49) 
By definition of i and by condition (|2.41|) we get 

(t + I)- 1 ' 2 < c l ' 2 N- c -l 2 - l l\ (2.50) 
Applying Borel-Cantelli Lemma proves (Hi). □ 

In order to estimate the third moment in Theorems 14. 1[ 14. 2\ and 14.71 we give a 
result similar to Theorem 12.41 but for £ = 3, i.e. for a given sequence of vectors 
(R^ 2 i ^23) R31) we estimate the cardinal of the set 

= {(^W 3 ) e X 3 : i?(aV 2 ) = < 2 ,i?( ( r 2 , ( T 3 ) = i^.i^V 1 ) - 

(2.51) 
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Below we omit the explicit dependence of the sequence (i?^, R23, R31) on N and 
instead of R\2,R 2 3 an d ^31 will write Ri2,R23 and R31 respectively. In order to 
formulate the theorem we introduce the following function on IR 3 : 

J l2 > (x, y, z) = log(l + x + y + z) + — - log(l + x - y - z) 

1 — x + y — z . . 1 — x — y + , 
H ^ log(l - x + y - z) H log(l - a; - y + z) 

(2.52) 

if |1 + x\ > \y + z\ and |1 — x\ > \y — z\ and j( 2 >(x,y, z) = +00 otherwise. 

Theorem 2.5. Suppose limsup-^ < 1. 

(i) If for some c\ > | £/ie sequence R12 satisfies (|2.37p , and if /or some > 1 

«w x Mlog2 , s cflogiV 

.7 (2) (^i2, i? 23 , i?si) < — + ^(iJia) - 1 N , (2.53) 

t/ien ¥ -almost surely 

|W^3| = (1+ (1))E|W^ 3 |. (2.54) 
(m) If for positive constants c^\c^\ 



^ + J(flu) _££^< jW(Rl2|ii3s , Rai) 



iV v w AT 

Mlog2 4 2) log^ 
< — ^ — + J{Ri2) + jj , (2.55) 

then there is a constant c such that P-almost surely 

\W* 3 \ < N C E\W* 3 \. (2.56) 

(Hi) If for some > § 

(Ri2,R 2 3, Rsi) > + J(Ri 2 ) + ^^1, (2.57) 

then P-almost surely the set W£ 3 is empty. 



Proof. ^From Lemma f2 .61 stated below it follows that, for arbitrary configurations 
a 1 , a 2 , a 3 S Sn, the function J^(R\2, R23, R31) is well defined. This lemma, 
whose proof we will omit, is a direct consequence of the fact that the Hamming 
distance on Sn satisfies triangle inequality. 

Lemma 2.6. For arbitrary configurations er 1 , a 2 and a 3 £ Sn we have 1 + R12 > 
\R 2 3 + Rail and 1 - R 12 > \R 23 —-Rail- 



The proof of the Theorem 12.51 is similar to that of Theorem 12.41 We begin by 
iting the size of the set W$ 3 as 

W,al= E E 1 ^»=i E ^.3=1 (2-58) 

u'eSiv <r 2 eSjv: <T 3 eSjv: 



Let us first estimate the number of terms in the last sum. This means that, given 
er 1 , a 2 £ Sn with overlap R(a 1 ,cr 2 ) = R 12 , we have to calculate the number of con- 
figurations er 3 with R(a 2 ,a 3 ) = R23 and R(a 3 , a 1 ) = R31. Without loss of generality 
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we can assume that all the spins of cr 1 are equal to 1. Further, let C(a 1 , a 2 , a 3 ) be 
a 3 x N matrix with rows a 1 , a 2 , a 3 . For a column vector 6 € { — 1, l} 3 we let ng 
be the number of columns of the matrix C that are equal to 5, i.e. 

n S = \{j<N: (a], a 2 , a 3 ) = S}\. (2.59) 

Then the overlaps can be written in terms of ng, namely 

"-(1,14) + "(1,1-1) - "(i,-i,i) ~ "(l-i-i) = NR 12 , 
"(1,1,1) _ "(1,1,-1) - "(1,-1,1) + "(1,-1,-1) = (2 fin) 

"(1,1,1) - "(1,1 -1) + "(1,-1,1) ~ "(1,-1 -1) = N ^31: 

«(i,i,i) + "(i,i,-i) + "(1,-1,1) + "(i,-i,-i) = Ab- 
solving this system of linear equations we find 

"(i.M) = 7N(1 + R 12 + R 23 +R 31 ), 

"(i,i,-i) = jN(l + R 12 - R 2 3 - R31), (9 p,i\ 

»(i,-i,i) = jrN(l-R 12 -R a3 +R 31 ), [Z - G1) 

"(1,-1,-1) = \N(1- R 12 + R 23 -R 31 ). 

We notice that specifying the configuration a 3 is equivalent to specifying the num- 
bers "(1,1,1), "(1,1,-1), "(1,-1,1), and ni ,-i,_i. Therefore the number of configura- 
tions a 3 € Sn with overlaps R(u 2 , a 3 ) — R23, and R(a 3 , a 1 ) — R31 is 

'"(1,1,1) + "(M-iA ( "(1,-1,1) + "(1,-1,-1) 

"(1,1,1) / V "(1,-1,1) 

("(1,1,1) +"(1,1,-1))! ("(!,_!,!) +"(1-1,-1))! 



"(1,1,1)! "(i,i,-i) ! "(1 -1,1)! "(1 -1,-1)! 



(2.62) 



Applying Stirling's approximation to (|2.62D one obtains that the number of terms 
in the last summation in (|2.58[) is of order 



2N e NJ(R 12 )-NJ^>(R 12 M 23 ,R 31 )^/ l R 2 
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Ny/P(R 12 ,R23,R3l) 

where P{x, y, z) = (1 + x + y + z)(l + x — y — z)(l — x + y — z)(l — x — y + z). The 
rest of the proof essentially is a rerun of the proof of Theorem 12.41 We skip the 
details. □ 



3. Proof of Theorem 11.11 

As in |BK06a| and |BCMN05a| . [BCMN05b] . the proof of the Poisson convergence 
is based on the analysis of factorial moments of the point processes Vn defined in 

dm. 

In general, let £at be a sequence of point processes defined on a common prob- 
ability space (fl, Q) and let £ be a Poisson point process with intensity mea- 
sure [i. Define the £ th factorial moment K(Z)i of the random variable Z to be 
EZ(Z — 1) . . . (Z — £ + 1). The following is a classical lemma that is a direct conse- 
quence of Theorem 4.7 in |Kal83j . 

Lemma 3.1. If for every I > 1 and every Borel set A 

lim E Q (£ N (A)) e - (^(A)) e , (3.1) 

N— >oo 

then the distribution of (£,n)n>i converges weakly to the distribution of £. 
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Applying Lemma 13.11 to the sequence of point processes Vn — M ^ ae 

following result proves Theorem ll.il 

Theorem 3.2. Under the assumptions of Theorem ] 1. 11 for every £ e N and every 
bounded Borel set A 

lim E(V N (A))i = (fj,(A)) e , P-a.s., (3.2) 

N— too 

where (i is defined in (|1.13| . 

Proof. We start with the computation of the first moment of Vn{A) : 

EV N (A) = J2 ¥ (HM£A). (3.3) 

As we saw in the proof of Theorem 12.11 the size of the random cloud |A| is P- 
almost surely 2 M (l + o(l)). Since H' N (a), a € Sn, are identically distributed normal 
random variables with mean —a^/b^ and variance 1/fr/v, the sum in (|3.3p can be 
written as 

2 m C ^ bN f e- x2b » /2 - aNbNX dx(l + o(l)), P-a.s. (3.4) 

A 

By the dominated convergence theorem and the definition of a^r it follows from 
(13. 4p that the limit of the first moment is /J,(A). 

To calculate factorial moments of higher order we follow BCMN05aj and rewrite 
the £ th factorial moment of Vn {A) as 

E(V N (A)) e = PWeA-,^)^) (3.5) 

where the sum runs over all ordered sequences of distinct configurations (er 1 , . . . , o~ ) S 
X e . To analyze it we decompose the set S N into three non- intersecting subsets 

S N = U e {R N ) U (U e (6 N )\U e {R N )) U (U e (5 N )) c , (3.6) 

where Ut is defined in (|2.2p . 8n is defined in Corollarv l2.3l and the sequence Rn is 
chosen such that: 

R N 0, Mv{R N ) -> 0, NR% oo. (3.7) 

This is possible since we have assumed that M — o(^/N) if v'{0) ^ and M = o(N) 
if f'(0) = 0. We recall here that the function v is defined in (JT77J) - (jTTHJ) . 

Having specified Rn, let us analyze the contribution to the sum (|3.5|) coming 
from the intersection of X e with the sets Ui(Rn),Ui(5n)\U£(Rn), and (Ui(5n)) c - 
Firstly, by Corollary 12.31 the intersection of the set (U(,(8n)) c with X e is P-a.s. 
empty and therefore its contribution to the sum (|3.5p is zero. Next, let us show 
that P-a.s. 

lim J2 ®{H' n {ct 1 )£A,...,H' n {ct*)&A)={ijl{A))\ (3.8) 

a^,...,a l 

where the sum is over all the sequences (a 1 , . . . , a 1 ) g Ui(Rn) H X 1 . 

For every I S N let Bio 1 , . . . , er £ ) denote the covariance matrix of random vari- 
ables H N (a 1 ), . . . ,H N (a e ). By (|1.7[) its elements, fry, are given by 

fry = i>(J2y) (3.9) 
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where we wrote R{a l ,<ji) = Rij. Since Rn = o(l) the matrix B(a 1 , . . . ,<r e ) is 
non-degenerate. We therefore get for (a 1 , . . . , a ) £ Ut(Rjsr)n X that 

F(H' N (a l )eA,... ) H' N (* e )€A) = 



x 

A 



e -(x,B- 1 x)b 2 N /2-a N b N (x,B- 1 l)-a 2 N (T,B- 1 l)/2 d £^ (3 jqN 



where B = B(a 1 , . . . , <r e ), x — (xi, . . . , xif and 1 = (1, . . . , 1)*. From the definition 
of oat we further get from (|3.10[) that 

i (<!-(r,B- 1 r))(Mlog2-l/21ogAf) 

PiKia 1 ) 6 4,..., H' N {o l ) eA) = ^== 

1 f f e ~(x,B- 1 x)b 2 N /2-a N b N (x,B- 1 T) d g_ /gj-g 



(V^Y 

A A 

Since the matrix B~ x is positive definite and since a^btq — > \J2 log 2 we con- 
clude from the dominated convergence theorem that for all bounded Borel sets A, 
uniformly in {a 1 , . . . , a 1 ) G U e (R N ) n X £ , 

1 f I e -(x,B- 1 x)b 2 N /2-a N b N {x,B- 1 l) d g 

~^ 21os2 ^dx = (fi(A)) e . (3.12) 



(V^Y 



e 



A A 

To evaluate e (Mf.B _1 f))(Miog2-i/2iogA/) in we i ookat (£— (1, -B _1 l)) det_B 

as a multivariate function of (i)jj)i<i<j<^- It is a polynomial of degree £ with co- 
efficients depending only on I and without constant term. It implies that \£ — 
(l,B _1 f)| = 0(v(R N )) and therefore, by |3~7)) . 

e (£-(l,B- 1 l))(M log 2-1/2 log M) = j_ + /jy ,3 jg) 

Combining (|3.1ip . (|3.12[) . and (|3.13|) we may rewrite the sum (|3 . 8|) as 

E i (1 + o(D) - (P(A)Y (1 + o(D). (3.14) 

Now it follows from Theorem O (i) and Lemma O (i) that \U e (R N ) n X £ | 
concentrates around its expected value, namely 2 M£ (1 + o(l)), so that (|3.14|) im- 
plies (j3~8|) . 

Next, let us establish that the contribution from the second set in (I3.6[) is negli- 
gible, i.e. let us prove that 

lim Y P (H'^a 1 ) G A, . . . , fl^(o^) G A) = 0, (3.15) 

o-i,...,<r« 

where the sum runs over all the sequences (a 1 , . . . , a 1 ) G {Ui{8n)\Ui{Rn)) n 
To do this we first bound the righthand side of (|3.1ip . By definition (|2.ip and 
Corollary 12.31 i? m ax(o' 1 , • ■ ■ , c ) < <5jv = o(l). Therefore following the same rea- 
soning as above we obtain from (|3.11jl that for some constant C > and for all 
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{a\...,a l ) e (Ut(5 N )\U t (RN))r\X e 

CM!/(iJ max ) 

FiH'ni* 1 ) £ A,..., H' N (*«) e A) < — ^ {^A)f{l + o(l)). (3.16) 

For fixed N the overlap takes only a discrete set of values 

2k 



K 



N 



|l-_:A = 0,l,. ..,Nj. (3.17) 

We represent the set [/^ (5jv)\[/^(i?jv) as a union of sets Ve(RN,k), where we denoted 
RN,k — 1 — 7^ € iCiV H (-Rat, <5 at). Let us fix k and bound the contribution from the 
set V((R N , k )nX e , i.e. 

PK^eA.-.^eA). (3.18) 

We obtain from Lemma \2~2l (ii) that 

\VAR N ,k)\ = 2 ffl -^e- NJ ( fl ^)(l + o(l)). (3.19) 

V -/V 



In the case M(N) < log A" we can choose the sequence Rn in such a way that the 
set Ue(6N)\Ue(RN) is empty. Therefore we can assume without loss of generality 
that M(N) > log AT. Applying part (ii) of Theorem 12. II we further conclude that 



\V t (R N ,k) n < 2^4Le-C 1 - < W(- R ".*) ) P-a.s. (3.20) 
v A" 



Using (|3. 16|) and the last inequality we bound the sum (|3. 18|) by 

C _ e -(l-a)NJ{R Nik ) e CMv(R N , k ) /g 21) 

N 



One can easily check that Mv(Rm,h) = o(NRj^ k ) for R^,k & (Rn,8n)- Together 



L N,k/ 

with S(x) > x 2 /2 it implies that for some positive constants C\ and C2 we can 
further bound (pT2~Tj) by 

C Le- c * NR ».K (3.22) 
A^ 



As a consequence, we obtain an almost sure bound 

J2 E V{H' N (^)eA,...,H' N (a e )eA) 

RN, k e ( CT 1 ,..,<T f )G 
K N n(R N ,s N ) vi(i?j V ,it)nx < 

<-^L E e"^^. (3.23) 



K N n(R N ,8 N ) 

Introducing new variables DN,k = VNR^.k we rewrite the above sum as 

^Ve- C <^, (3.24) 

VN,k 
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where the summation is over the discrete set y/NKpj n (-vNRn,vN5n)- Since 
NR 2 ^ — > oo, then for arbitrary C > and for large N we can further bound this 
sum by 

9i y e -c 2 y%, k 2 _ (3 . 25) 

VN,k>C 

Interpreting the last sum as a sum of areas of nonintersecting rectangles with one 
side equal e~ C2yN < k and the other we bound it with the integral 

e- C2y2 dy (3.26) 



Since the constant C is arbitrary we get that (|3 . 1 5|) is o(l). This finishes the proof 
of Theorem 13.21 and therefore of Theorem 11.11 □ 



4. Proof of Theorem 11.41 

In order to prove the breakdown of universality in Theorem 1 1.41 we use a strategy 
similar to that used in |BCMN05b| to disprove the local REM conjecture for the 
number partitioning problem and for the Sherrington-Kirkpatrick model when the 
energy scales are too large. We prove that P-a.s. for every bounded Borel set A 

(1) the limit of the first factorial moment exists and equals fJ,(A); 

(2) the second factorial moment E(Pn(A))2 does not converge to (^l(A)) 2 ; 

(3) the third moment is bounded. 

These three facts immediately imply that the sequence of random variables Vn(A) 
does not converge weakly to a Poisson random variable and so the sequence of 
point processes Vn does not converge weakly to a Poisson point process. Part (i) 
of Theorem 11.41 is thus obviously implied by the following 



Theorem 4.1 (Breakdown of Universality for the number partitioning problem). 
Let v(r) = r. For every bounded Borel set A 

limE(-p w (^))i = M(A), P-a.s. (4.1) 

Moreover, i/limsup M j^ > = e < oo then P-a.s. 

(i) limsu P E(7V.4)) 2 = e 2£2lo s 22 ( M (A)) 2 , 

(ii) limsupE^Ar^)^ < oo. 

Similarly, part (ii) of Theorem II .41 is implied by the following 

Theorem 4.2 (Breakdown of Universality for the Sherrington-Kirkpatrick model). 
Let v(r) = r 2 . For every bounded Borel set A 

limE(P JV (i)) 1 = p(A), P-a.s. (4.2) 

Moreover, if lim sup = e < 8i ^ g2 then P-a.s. 

_ MA)) 2 . 



(ii) limsupE(7 : 'jv(-4))3 < oo. 
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Remark 4.3. Condition e < 81 o g2 in not optimal and could be improved. The 
reason for such a choice is that for e < 81 Q g2 the third moment estimate is quite 
simple. 



We will prove in detail Theorem 14.21 but omit the proof of Theorem 14.11 as it is 
very similar and much simpler. 

Proof of Theorem \4.^\ We successively prove the statement on the first, second, 
and third moment. 

1. First moment estimate. 

Since all the random variables H' N (a), a G Sn, are identically distributed then 

E{V N (A))! = \X\F(H' N (a) G A). (4.3) 

We saw in the proof of Theorem 12.11 that for M(N) satisfying logiV = o(2 M ), 
\X\ = 2 M (1 + o(l)) P-a.s. Combined with (f3~TTj) . the definition of a N , and the 
dominated convergence theorem this fact implies that P-a.s. 

E(^(A)) X = 2 M (1 + (1)) J^-L I e - 2 b%/2-a NbNXdx = KA){1 + o{1) y ^ 

Hence (|4.2p is proven. 



A 



2. Second moment estimate. Next assume that limsup M ^ — s. We now want 
to calculate limsupE('P7v(^4))2- For this we rewrite the second factorial moment as 

E(V N (A)) 2 = £ P ( H n(^) G A, H' n (a 2 ) G A) , (4.5) 

where the summation is over all pairs of distinct configurations (a 1 , a 2 ) G X 2 . We 
then split the set into four non-intersecting subsets and calculate the contri- 
butions from these subsets separately (these calculations are similar to those of 
Theorem 



(1) We begin by calculating the contribution from the set S± , where 

S 1 = {(a\a 2 )&S%: \R(a\ a 2 )\ < t n } (4.6) 

and where the sequence tat is chosen in such a way that Nrfj — > and e~ NTN decays 
faster than any polynomial: this can be achieved by choosing e.g. tm — N i/i^ osN - 
First, we get from p. lip for I = 2 



{H> N (o- 1 )eA,H' N (a 2 )eA) = 



1 e 1+i, i2 



(M log 2—1/2 log M) 



1 



22M y/T=i& 



(v^) 2 

A A 



e -(3,B x)b N /2-a N b N {2M 1)^^ (47) 



where 612 = cov^^ct 1 ), H' N (a 2 )) = R 2 12 . If {a\a 2 ) G Si then b u = o(l) and 
we get from ()3. 12|) that uniformly in (a 1 , a 2 ) G Si the second line in Ij4.7jl is just 

( M (A)) 2 (i + o(i)). 
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Next, we represent the set Si as a union of sets V^t-Rjv.fc) with iijv.fc = 1 — 77 € 
if at n [0, ta?]. Applying Theorem 12.41 (i) we get that 

|V 2 (#iV,fc) n X 2 \ = 2y^2 2M e " JVJ ( i? ~^)(l + o(l)). (4.8) 

Therefore, up to a multiplicative term of the form 1 + o(l), the contribution from 
the set to the sum (I4.5|) is 

Y J_ e T ? fe(Mlog2-l/2 1 o g A/) (M(A))2 2 rE 2 2M e -NJ& N , k ) 

£■ — < 2 V ttN 



= 2^ A( M (A)) 2 £ e -^(fl N , (t ) e T¥&(Mlog2-l/2 1 ogA/)_ (4 Q) 

Since &i2(-Rjv,fc) = -Rjy fc > the ^ as ^ sum ^ s monotone in M. As we will see below 
due to this fact it is sufficient to calculate the upper limit of M(V(A))2 for sequences 
of the form M(N) = sN with e G (0, g^). Thus let M — sN, e G (0, g^). We 
obtain 

T^f-(M log 2 - 1/2 log M) = 2eJVi^ )fc log2 + 0{N^ h ). (4.10) 
i + 012 

The choice of rjy guarantees that for Rw,k < Tiv the last term in the rhs of (|4.10p 

2' 



is of order o(l). Moreover, for x < 1 we observe that J{x) = \x 2 + 0(x 4 ). Thus 

NR 2 Nk 

NJ{R N . k ) = — ^ + o(l). (4.11) 
Using (j4T0|l and ([4~TTjl the sum (14.91) becomes 

2^{^{A)) 2 e-i NR »^-^ 2 Hl + o(l)), (4.12) 

where the summation is over Rn^ € K^D [0, rjv]- Introducing new variables y;v,fc = 
VNRN,k we further rewrite (|4. 12|) as 

-V J =e -4(l-^log2) (1 + o(1)) ^ (413) 

VN,k 

It is not difficult to see that for e < 41 * 2 the sum in (|4. 13|) converges to the integral 

<4 ' 14) 

u 

Therefore the contribution from the set is ^ A )) (l + o(l)). 

•y/l-4s log 2 

We can extend this result to the case when M(N) — eN + o(N) with s < 8 ^ 2 . 
Indeed, assume that e n | e as n — * 00. Then using the above calculation for 
M = e n N together with the monotonicity argument we have that for all n > 1 

E(V N (A)) 2 > { ^ )? (4.15) 
VI - 4e„ log 2 

Taking the limit in n we obtain a lower bound. With exactly the same argument 
we prove the corresponding upper bound. 
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(2) Next, we estimate the contribution from the set S 2 , where 

S 2 = {{a 1 , a 2 ) G S% : \R(a l , a 2 )\ > t n and R(a\ a 2 ) satisfies ([2321) }• (4.16) 

Since the set A is bounded an elementary computation yields that for a constant 
C = C(A), uniformly in (a 1 , a 2 ) G S 2 , 



A A 



For fixed N we let Rjj to be the largest value of the overlap satisfying condition 
(|2.37p . Then representing 52 as a union of sets V2(RN,k), where R^,k — 1 — 77 € 
Kjq n [tjv, fijv]i an< ^ usm g Theorem 1 2. 41 (z), we conclude that the contribution from 
the set S 2 is, up to a multiplicative term of the form 1 + o(l), bounded by 



This quantity is monotone in M. As a consequence, to show that it is negligible 
in the limit N — > 00 it suffices to show this fact under the assumption M(N) = eN. 
Thus, letting M = eN and using that J (x) > \x 2 , we can bound the exponent in 
KWi by 

-N (j{R N ,k) - -^^elog2) < ~Nx 2 {l - 4elog2). (4.19) 



1 + 6 



12 

Since 1 — b 2 2 > 1 /N and since the number of terms in (|4. 18)) is at most N, we can 
further bound it, for some positive constant C > 0, by 

CN 2^-e~i NT ^ 1 - 4£l °z 2 \ (4.20) 

Since e < 81 o g2 the contribution from the set iS^ is negligible by definition of tm- 

(3) We now analyze the contribution from the set , where 

S 3 = {{a 1 , a 2 ) G S 2 N : R{a l ,a 2 ) satisfies d2~39)l }. (4.21) 

Let i?^ be the largest overlap value satisfying condition (|2.39|) . To bound the 
contribution from the set S3 we represent it as a union of sets V 2 {RN,k), where 
Rn,U — 1 — 77 runs over the set Kn fl [R^^R 2 ^]. Proceeding as in (2) and using 
Theorem 12.41 (u), we bound it by 



Again, the sum is monotone in M and thus it is enough to bound it for M = eN. 
For Rff,k satisfying condition (|2.39)1 we can bound the exponent in (|4.22j) as follows: 

21,12 -sNlog2- NJ(R N ,k) < 2&12 eN log 2 — Ne log 2 — ci log N 



612 ' I + 61 

I-612 



I + 612 

where C is some positive constant 



eN log 2 - ci log N < -CNe log 2 - ci log AT, (4.23) 
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Since 1 — 6f 2 > 1 /N and since there are at most N terms in the sum (|4.22[) , we 
can bound the latter by 7V c +!- c i exp(— CWelog2). Therefore, P-almost surely the 
contribution from the set is negligible as N — > oo. 

(4) To finish the second moment estimate it remains to treat the set 

5 4 = {(a- 1 , a- 2 ) S S% : R(a\cr 2 ) satisfies (gUP }. (4.24) 

But by Theorem 12.41 (Hi), the set is P-almost surely empty. This finishes the 
proof of assertion (i) of Theorem 14.21 



3. Third moment estimate. To analyze the third factorial moment we use that, 
by formula (|3.5|) and definition (|2.51|) . it can be written as 



., ,. ., 2 3M Vdetfl 



W X I 

^,31 e (3-(r,S- 1 l))(Mlog2-l/21ogM) 



X 

AAA 



-{x,B 1 S)bj v /2~a N b N (x,B 1) ^5! (4 25) 



where B = B(a 1 , a 2 , a 3 ), the covariance matrix of the vector (Hjs[(a 1 ), -£/at(<7 2 ), Hn{o~ 
is the matrix with elements bij = R 2 j, 1 < i,j < 3, and where the summation runs 
over all triplets of overlaps (R12, R23, R31) & K 3 ^. To estimate this sum we rely on 
three auxiliary lemmas whose proofs we skip since they are simple. 

Lemma 4.4. If e < gl ^ g2 then 

limsup max / / / e -^,B-^b%/2-a N b N {S,B-H) d$ < p _ a s u 2g s 
N ^oo (r'^vexj J J 

AAA 

Lemma 4.5. P-almost surely for all configurations a 1 , a 2 , a 3 G X 

3-(l,B- 1 l)<2(R 2 12 + R 2 3 + R 2 1 ). (4 2?) 

Lemma 4.6. For all a 1 , a 2 , a 3 £ Sn 

J i2) (R 1 2,R2Z,R3l)>\(R 2 1 2+R 2 23 +Rll)- (4-28) 

We are now ready to estimate sum (|4.25[) . In the same spirit as for the second 
moment calculation, we will split (|4. 25[) into four parts and show that every each 
of them is bounded. Moreover, by monotonicity argument similar to that used in 
calculation of the second moment we can restrict our attention to the case when 
M = eN. 

(1) We first calculate the contribution to (|4. 25[) coming from the set 

S 1 = {(a 1 1 a 2 ,a 3 )eS 3 N : max{\R 12 \,\R 23 \,\Rn\} < r N ], (4.29) 

where tn — N i/i\ ogN - Using Theorem 12.51 (i) and Lemma l4.5i we obtain that the 
contribution from §1 is at most of order 

(4.30) 



AT3/2 
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where the summation is over R12, R23, R31 € Kn PI [~tn, tat]. Expanding in Taylor 
series we obtain that for I-R12I, I-R23I, I-R31I < t~n 

J {2) {R 1 2,R23,Rzi) = \{R\ 2 + Rl3 + Rli) +0(t 4 n ), (4.31) 
and thus (|4.30[) is bounded by 

e - 2 -N(R 2 2 +R 2 23 +R 2 31 )+2eN(R 2 2 +R 2 23 +R 2 1 )lo g 2 



_L J2 e-*^^ 1 - 4 ^ ^)] <oo. (4.32) 



Rl2,R23,R31 



N 

Rn 6 K N n [ - T N , Tjv ] 

(2) We next calculate the contribution from the set 

S 2 = {(ctW 3 ) e S N :i?i2,i? 23 ,i?2i satisfy (l2~53l 

and maxjl^l, \R 23 \,\R 31 \} >t n }. (4.33) 

Without loss of generality we can assume that I-R12I > tat- Then, using Theorem 
12.51 (i) and Lemma 14.61 the contribution from this set is at most of order 

E p -NJ^(R 12 ,R 23 ,R 31 ) 
2(R 2 12 +R 2 23 +R 2 31 )Mlog2 (A-U\ 
„ NW(R 12 ,R 23 ,R 31 ) ' 1 '* > 

^112,^123 ;^f-31 

where P(x, y,z) — (1 + x + y + z)(l + x — y — z){\ — x + y — z)(\ — x — y + z) 
and where the sum is over the triplets (-R12, -R23, -R31) & K N satisfying (|2.53p and 
I-R12I > tn- Then, from Lemma l4~5l we further get that the sum ()4.34[) is bounded 

by 

A. 7V3/ 2 P(i? 12 ,i? 23 ,i?3i) 

|-Hl2|>TN 

e -iN(l-8e\o g 2)(R 2 12 + R 2 3 + Rl 1 ) 

= lp -^ Ny 2 P(R 12 ,R23,R 3 i) (4 ' 35) 

Since the number of terms in the sum is at most (iV+1) 3 , and since e -i(^Selo g 2)Nr 2 N 
decreases faster than any polynomial it follows that the sum is of order o(l). 

(3) We now turn to the contribution from the set 

S 3 = {(aVV 3 ) e S% : Ri 2 ,R 23 ,R 31 satisfy ^M} ■ (4-36) 
By Lemma 12.51 (ii) the contribution from this set is at most of order 

^ CE l^JV,3l p (3-(f,B-iD)(Mlog2-l/21o S M) U 37) 

„ R 2 3M v / det~B 

Rl2 ,K23,«31 

where the summation is over the triplets (R12, R23, R31) € Kn satisfying (|2.55|) . 

2 3M e -Af J< 2 > (R 12 ,i?. 23 ,R 3 i) 



Since E|Wjy 3 | is of order 



N^P(R 12 ,R 23> R 31 ) 



(4.38) 
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iV c e-3 JV («?2+-R23+-Rli) 

^ 7V 3 / 2 P(i?i 2 ,i?23,i?3l)' 



we obtain, using Lemmas 14.51 and 14. 6\ that the sum (|4.37|) is bounded by 

a 2eNlog2(R 2 12 +R 2 23 +Rl 1 )^ (439) 

V ~ / I / 1 1 ) /!■>'' M M I 

^12,^23^-3 

It is easy to show that the triplet (-R12, -R23, -R31) that satisfy (|2.55[1 must satisfy 
either |i?23| > tat or |i?3i| > tat. Therefore we can further bound the contribution 
from the set £3 by the sum 

V - IL^NXo^iR^+Rls+Rl,) f440 N 

|i?23|>r« Or \R 3 l\>r N V ' d ' 3 ' 

which is o(l) by the same argument as in part (2). 

(4) To finish the estimate of the third factorial moment we have to estimate the 
contribution to (|4.25|) coming from the set 

Si = {{a 1 , a 2 , a 3 ) e S 3 N : ^12,^23,^21 satisfy fZgfi}. (4.41) 

By Theorem 12.51 (Hi) the set is P-a.s. empty and therefore its contribution is 
P-a.s. zero. This finishes the proof of assertion (ii) of Theorem 14.21 The proof of 
Theorem 14.21 is now complete. □ 



For comparison with the cases v(r) — r p for p = 1 and p = 2, we give here the 
asymptotic behavior of the first three factorial moments for the case v(Q) — v'(Q) = 
0, i.e for instance for the case v(r) = r p of pure p-spins when p > 3. This behavior 
is compatible with a Poisson convergence theorem. 

Theorem 4.7. Assume that ^(0) = z/(0) = 0. For every bounded Borel set A 

limE(V N (A))i = n(A), P-a.s. (4.42) 

Moreover, if lim sup < 81 * g2 then P-a.s. 

(i) limE{V N (A)) 2 = (^(A)) 2 ; 

(ii) hmE(7V(A)) 3 = (p(A)) 3 < 00. 

We do not include a proof of this last statement, which again follows the same 
strategy as the proof of Theorem 14.21 



5. Universality for Non-Gaussian Hamiltonians 

In this section we extend the results of the previous sections to the case of non- 
Gaussian Hamiltonians. We are able to make this extension only for the pure p-spin 
models, i.e. v(r) = r p . In this case we recall that the Hamiltonian is defined as 

l<ii....,i p <N 

Our assumptions on the random variables (gi ll ... J u,)i<i lt ... t i p <N in (|5 . 1 1) are the 
same as were made in [BCMN05a and BCMN05b for the number partitioning 
problem. That is, we assume that their distribution function admits a density p(x) 
that satisfies the following conditions: 

(1) p(x) is even; 

(2) / x 2 p(x)dx = 1; 
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(3) for some e > 

oo 

J p{x) 1+e dx < oo; (5.2) 

— oo 

(4) p(x) has a Fourier transform that is analytic in some neighborhood of zero. 
We write 

- logp(z) = 1 -(2-k) 2 z 2 + c 4 (27r) 4 z 4 + 0(|z| 6 ). (5.3) 

Note that the inequality E(X 4 ) > E(X 2 ) 2 implies that necessarily C4 < A. 
Under these assumptions we will show, using the method introduced by C. Borgs, 
J. Chayes, S. Mertens and C. Nair in |BCMN05bj . that Theorems O and O still 
hold. 

5.1. Proof of Universality. In this subsection we fix p > 1 and prove the analog 
of Theorem 1 1.1 1 in the non-Gaussian case assuming that the Hamiltonian is given by 
(|5.1[) . and that the random variables (5i 1 ,...,i p )i<i 1 ,...,i ! ,<Ar satisfy conditions (1) — (4) 
above. 

Theorem 5.1 (Universality in the Non-Gaussian case). Assume M{N) — o(vJV) 
for p — 1 and M = o(N) for p > 2. Then V-almost surely the sequence of point 
processes Vn converges weakly to a Poisson point process V on 1 with intensity 
given by 

p(dt) = -Le-tV^^dt. (5.4) 

To prove Theorem l5.1l we essentially prove a local limit theorem. More precisely, 
for any fixed N let us introduce the Gaussian process Zn on Sn that has the same 
mean and covariance matrix as the process H' N (o~) defined in (|1.2|) . We will prove in 
Theorem 15 . 2 1 that P-a.s., for all sequences (a 1 , . . . , a 1 ) £ X e , the joint density of the 
random variables H' N {a 1 ) 1 . . . , H' N (o~ e ) is well approximated by the joint density of 
Z N {a x ), . . .,Z N (o- 1 ). 

Theorem 5.2. Assume M(N) = o(y/N) for p = 1 and M = o(N) for p>2. Then 
P -almost surely for every £ > 1 and every bounded Borel set A there exists c > 
such that uniformly in (a 1 , . . . , a e ) E X 1 

HH'n^) e A, j = !,...,£) = F(Z N (o-i) e A, j = 1, . . . ,£) 

x (l + O( J R max (a 1 ,...,a0) + o(^)) +0(e~ cNP ). (5.5) 

Applying Theorem l5. 21 together with Theorem ll.ll we get from formula (|3 . 5|) that 

E(V N (A)) e = (n(A)Y(l + o(l)) + 0(2 Me e~ cNP ) - (p(A)) e , (5.6) 

which, by Lemma l3.ll implies weak convergence of the sequence of point processes 
Vn to a Poisson point process with intensity measure p, thus implying Theorem 
15.11 We therefore focus on the proof of Theorem 15.21 

Proof. First, we obtain from the definition of H' N (a) that 

\H' N (o-) S (x,x + Ax)j =| 9iu-,ip a h ■■■ (J ip e 

l<ii,...,i p <./V 

(a N + xb N , a N + (x + Ax)b N )^\. (5.7) 
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Following BCMN05b we get an integral representation of the indicator function 



Iff' (<r)E{x,x+Ax) 



Ax 



where, for brevity, we wrote n = N p ,ctN — a n +bN(x+ Ax/ 2), sinc(x) = sm ^ x ' > , and 
where the sum in the exponent runs over all possible sequences 1 < i\, . . . ,i p < N. 

Changing the integration variable in (|5.8|) from / to — / and applying the re- 
sulting formula to the product of indicator functions we arrive at the following 
representation 

l i 

g 

]J sine ( fjAxj b N V77) e~ 2m/j I>i - <p< • + ™A , (5.9) 

where a^j' = a^v + b^(xj + Axj/2). Introducing the variables 

e 

<■■ , E-^ (5.10) 

we rewrite the integral in the above formula as 

1 

J] e -2" 9il ...« P "H.-,% J^sinc^A^^e 2 ^ "^™^. (5.11) 

l<h,...,i p <N j = l 

To get an integral representation of the joint density 

P(^((7 1 ) e (x u xi + dx 1 ),...,H' N (a i ) G (a:*, X£ + dx e )) (5.12) 

we have to take the expectation of (|5.9|) and then let Arc.,- — » for all j = 1,2, . . .£. 
As was proved in [BCMN05a] (see Lemma 3.4), the exchange of expectation and 
integration for p = 1 is justified when the rank of the matrix formed by the row 
vectors a 1 , . . . , cr, is I. To justify the exchange in our case we introduce an £ by 
iV p matrix, C p (a 1 , . . . , er £ ), defined as follows: for any given set of configurations 
a 1 , . . . , cr e , the j-th row is composed of all N p products, ej 2 . . . cr? over all subsets 
1 < i\, . . . , i p < N. By generalizing the arguments from [BCMNOSa] the exchange 
can then be justified provided that the rank of the matrix C p (a 1 , . . . , cr ) is £. As 
we will see in Lemma 15.31 below this holds true P-almost surely when M = o(N) . 

Given a vector 8 € { — 1, 1} let ng be the number of times the column vector S 
appears in the matrix C p : 

n s =n s {a l ,...,(T l )= \{j < N p : (cr), . . . , oj) = S}\ . (5.13) 

With this notation we have: 

Lemma 5.3. Suppose M = o(N). Then there exists a sequence Xn = o(l) such 
that P-almost surely for all collections (er 1 , . . . , cr ) G X e 

n 

ns 



max 
<5e{-i,i}' 



2 f 



<n\ N . (5.14) 
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Proof. We first prove by induction that the following simple fact holds true: if 
for a given sequence of configurations (er 1 , . . . , a ) <G S N the matrix Ci(a 1 , . . . , a e ) 
satisfies condition (|5.14p then necessarily the matrix C p {a , . . . , a e ) satisfies (15.14)) 
for all p > 1 . 

For p — 1 there is nothing to prove. We now assume that the statement is true 
for the matrix C p _i(ct 1 , . . . , cr) and prove it for C p {a 1 , . . . , cr). 

Let cr M , 1 < /i < N denote the columns of the matrix Ci(a 1 , . . . , a 1 ). For every 
column vector let us construct a matrix C p 1 _ 1 = Cp_ 1 (a 1 , . . . , cr) with entries 

(C£-i)« = Mi (C P -i)ij. (5.15) 

For future convenience let rig denote the variable rig for the matrix C p 1 _ 1 . ^From 
the inductive assumption it follows that for all 1 < fi < N 

< N^Xn. (5.16) 

Now note that the £ x N p matrix C p (a 1 , . . . , a e ) can be obtained by concatenating 
TV matrices Cp_ 1 (cr 1 , . . . ,o- £ ) each of size £ x iV p_1 . Therefore for any sequence of 
configurations (cr 1 , . . . , a e ) with matrix Ci(a 1 , . . . , a 1 ) satisfying (|5.14j) we have 



max 
s 



2 e 



n s 



NP 



< 



NP- 



< NP-^n + ■■■ + NP~ l X N 



NP- 1 
N p X N 



< 



(5.17) 

and the induction is complete. 

To prove Lemma l5.3l it is thus enough to demonstrate that P-almost surely there 
are no sequences (er 1 , . . . , a 1 ) £ X e such that Ci(cr 1 , . . . , o- e ) violates condition 
(|5.14p . Let us prove this by induction in I. 

For £ = 1 let us introduce the sets 



T N = { a £ S 



N 




Tlx 


< NX N ^ 


2 





Then by Chcrnoff bound 



ATI 



i>N\ N 



N 



< 2 JV+1 e _ ^ Ar((1+Alv)log(1+Alv) ~ Ajv) 



(5.18) 



(5.19) 



Let us choose Xn = o(l) in such a way that M — o{NX 2 N ) and log TV = o(NX 2 N ) 
Then using (|2.12p we obtain that 

l ff = 0, P-a.s. 



(5.20) 



<rer= 



which proves the statement for i = 1. Now assume that P-a.s. for all sequences 
(er 1 , . . . ,cr i ~ 1 ) £ Xt 1 -^ the matrix C^cr 1 , . . . ,o- l ~ x ) satisfies condition (|5. 14|) . Since 
there is only a countable number of sequences (cr 1 , . . . , o l ~ v ) we fix (cr 1 , . . . , a 1 " 1 ) £ 
Y^- 1 ) anc i prove that P-almost surely there are no configurations a 1 such that 
Ci(cr\...,</) violates ([SH]) . 

Let 6 £ {-1, l} f be given. Define Si(8) £ {-1, l}*" 1 as (<5i) l = (S) z for 1 < i < 
I — 1 and also define c>2(e>) € { — 1, 1} as S 2 = (S)e. Let us also introduce, for given 
ch £ {-l,l} e -\ the set 

N Sl = {j<N:(a 1 j ,...,a^ 1 )=S 1 }. (5.21) 
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By the inductive assumption we conclude that for all S± € {— 1, l} e 1 

N 



\N Sl \ 



2 t-i 



< NX 



N ■ 



(5.22) 



From the definition of Ng it is not hard to see that for all d e {—1,1}^ 

n s = \{j eN SliS) :a i j =S 2 (8)}\. (5.23) 



Using the above relation together with (|2. 1 If) and the assumptions on Ajv we get 
that P-almost surely 



ns 



\N Sl 



(«)l 



Therefore from ([532)1 and ([5T24| 



ns 



N 



< 



\N, 



Si(S)\ 



\N. 



S 1 (S) 



< 



N 

¥ 



NX 



N 



< 



NX 



N 



NX 



N 



The induction is now complete and the lemma is proved. 



(5.24) 

NX N . (5.25) 
□ 



Lemma 15.31 implies that P-almost surely, for all (a 1 , . 



"minCc 1 , . . . ,<r e ) = min n s = -j(l + 0(X N )), 



Se{-i,i} 



(5.26) 



and hence, for sufficiently large N, the rank of the matrix C p (a 1 , . . . , a 1 ) is £. The 
exchange of integration and expectation is thus justified. 

Using once again Lemma 15.31 condition (|5.2p . and the dominated convergence 
theorem we obtain that the joint density is 



(iJ^(cr J ) € (xj,xj + dxj) for j = 1, ... ,t) — \\ bN\/ndxj 

i=i 



l<ii 



where we redefined = ajv + b^Xj . We remark for future use that 
for all 1 < j < i. 

It is straightforward at this point to adapt the saddle point analysis used in 
BCMN05b to calculate the integrals of such type. The only difference is that 
instead of the matrix C\{a x , . . . with rows formed by row vectors a 1 , . . . , a e , 
we use the matrix C p (a 1 , . . . , a e ). By analogy with Lemma 5.3 from BCMN05b 
we first approximate the integral in (|5.27p by an integral over a bounded domain, 
i.e. for some ci > depending on > 



,-,i P <N j 1 



M _ 

N ~ 



(5.27) 



0(a N ) 



3=1 



II P(vn,..., lp )Y[e 2 ^ a -^df 3 +0(e- c ^). (5.28) 

i,—,ip J =1 
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We next rewrite the integral in the r.h.s. of (|5.28|) as 



e 2mt - a J] Kf-ST'Iidfr, (5.29) 

ae{-i,i} < i =1 

<i) (i) 

where a = . . . , / = (/i, . . . , /*>), and where a ■ f = a-i/i H hai/« is 

the standard scalar product. 

Using Lemma l5~3l again we can apply Lemma 5.4 from [BCMN05b] to conclude 
that given fj,\ there are constants ci(/ii) > and /i 2 > such that the following 
equality holds whenever rji, . . . , r\g is a sequence of real numbers with J2j < 

and r]ja$ > for all j = 1, . . . ,£ 



JJp(f • S) n * Y[ e 2 ™^< dfc (5.30) 



'-^1 5 j=l 

The values of the shifts r)%, . ■ ■ ,7]£ are determined by the following system: 

0') 

J2^S j F'(iS-r,) = 2m^, j = l,...,£, (5.31) 



where we wrote F = — log p. 

Since max \R(<r, <j')\ is P-almost surely of order o(l) when M = o(N) we can 

apply Lemma 5.5 from BCMN05b and obtain that this system has a unique solu- 
tion 

r,(a) = ±B- 1 a(l + 0(\\a\\l)). (5.32) 
Moreover, for sufficiently small fii, 

Ml t 

e 2nn( t f- a -r,- a ) p(f ■ 8 + IT) ■ S)" 5 ]J dfj 

S j=l 

= e- nG -^ ' (1 + 0(n-!/ 2 ) + 0(a 2 N /n) + 0(i? max )) , (5.33) 

where 

Gn^(a) = V — F(i<5 • rj(a)) + 2707(0:) ■ a. (5.34) 
' n 

Therefore 

f b \ l 1 
P(H' N (a 3 ) G (xj, + dxj) for j = 1, ...,£)= I — ^= J e -«G„,,(oO ^ 

x (1 + C^n" 1 / 2 ) + 0(«&/n) + 0(i? max )) + O^n^e-i 01 "—). (5.35) 
Expanding G n ^i we get the approximation 



iG n , l {a) = ^{oc,B- 1 cx)+o(^-). (5.36) 
z V n J 
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By definition on — 0(>/~M). Under the assumptions of Theorem l5.2l we get that 
a N = o(-\fN) for p = 1, that a^r = o(y/~N) for p > 2, and thus that a^, = o(n). 
It implies that asymptotically the joint density (|5.12[) is Gaussian. More pre- 
cisely, it follows from the equations (|5.35[) and (|5.36[) that P-a.s., for all collections 

(a\...y)ex*, 

l 4 

J] d Xj (l + 0(JU) + °(^f )) + 0{b%n l ' 2 e-^ n ^). (5.37) 



By Lemma T5.3l the term 0(b e N n e ^ 2 e 2 c i"-"n) is of order o(e cAfP ) as — > oo. This 
finishes the proof of Theorem 15.21 □ 



5.2. Breakdown of Universality. In this subsection we follow the same strategy 
as we used in the proof of Theorem 1 1.41 - we fix a bounded set A and study the first 
three factorial moments of the random variable Vn(A). In the case of the number 
partitioning problem the following theorem implies that the Poisson convergence 
fails as soon as limsupM/viV > 0. 

Theorem 5.4 (Number partitioning problem). Fix p — 1 and let the Hamiltonian 
be given by (|5.1|) . For every bounded Borel set A we have 

\im^{V N {A))i= li{A)e- ic ^ Xo ^ 2 , P-a.s. (5.38) 

Moreover, if lim sup M ^ = £ < oo then P-a.s. 

(*) liniSU P E('P A r(yl)) 2 = e 2e 2 log 2-32c 4£ 2 log 2 2(^)2. 

(ii) limsupE(Pjv(^4))3 < oo. 

Therefore the limit of the ratio of the second factorial moment to the square of 
the first is 

E(Pn(A))2 o2 £ 2 log 2 2-24c 4 £ 2 log 2 2 _ 2e 2 log 2 2(l-12c 4 ) 



nv N (A))i 



g 2e- log" 2-24c 4 £- log" 2 _ g 2£- log" mi-±2C4j (5 3g\ 



Taking into account that C4 < we conclude that the ratio is strictly larger than 
one and thus there is no Poisson convergence for e > 0. 

And in the case of the Sherrington-Kirkpatrick model the failure of Poisson 
convergence follows from 

Theorem 5.5 (Sherrington-Kirkpatrick model). Fixp = 2 and let the Hamiltonian 
be given by (|5.ip . For every bounded Borel set A 

limE(V N (A))i =/i(A)e- 4c4e2log22 , P-a.s. (5.40) 

Moreover, if lim sup M ^ = e < sl ^ g2 ^hen P-a.s. 

(i) limsu P E(7MA)) 2 - e-J 2 ^ 2 ^ 

^/l-4e log 2 

(ii) lim sup E(Vn (A)) 3 < 00. 
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The ratio of the second factorial moment to the square of the first moment is 

E(P N (A))l ~* Vl-4£log2- [ °- ' 

For e > the above ratio is strictly larger than one and thus convergence to a 
Poisson point process fails. 

We will give the proof of Theorem 15.51 only since the case p = 1 is based on 
essentially the same computations. 

Proof of Theorem 15.51 As in the proof of Theorem 14.21 we successively prove the 
statement on the first, second, and third moment. To simplify our computations 
we will assume that M = eN (Using the monotonicity argument, the case of general 
sequences M{N) can be analyzed just as in Theorem 14.21 of Section|U). 

1. First moment estimate. 

Following the same steps as in Subsection 15.11 we approximate the density of 
H' N (°) by 

e -«G„,i(a w ) ^L(l + O(-L) + O(^)) + 0{e- cin ), (5.42) 



/2ttV Vat/ v n 

where according to the notation introduced above n = N 2 . 

In the case M = eN we need a more precise approximation of the function G n ^ 
than given by formula (|5.36[) . Expanding the solution of the system f|5.31[) as 

r,( a ) = —B- 1 a + —Y—{S,B- 1 a) 3 B- 1 S + 0{\\a\\ 5 ) (5.43) 
2tt 2tt '-^ n 

s 

and applying (|5.43[) we obtain from (|5.34[) and (|5.3[) that 
n <„\ (2tt) 2 \ ^ n s 2 
s 

+ c 4 (27r) 4 y ^{6 • J?) 4 + 2n{ V ■ a) + 0(\\a\\ 6 ) 

8 11 

= i(a,B- 1 a) + c 4 y — (<5,B- 1 a) 4 + 0(||a|| 6 ). (5.44) 
2 z — ' n 
s 

Using the approximation for G n> i given by formula (|5.44[) . we obtain 



nG n , 1 = ^+c i ^+0 , (5.45) 

where ajv = fljv +6jv^- Since afj/n = 4c4£ 2 log 2 2(l +o(l)), we see that up to an 
e-dependent multiplier the density of H' N (o~) is given by the normal density, more 
precisely, the density of H' N {<r) is 

_L e -^/2 e -4 C4£ 2 iog 2 2( 1 + (_L^ +0 (^)) + 0( e - c ' n ). (5.46) 

Therefore the first factorial moment of Pjy (A) is 

fi(A)e- 4c4£2 log2 2 (1 + o(l)) + o(l) (5.47) 

and (|5.40[) is proven. 



A NEW REM CONJECTURE 



29 



2. Second moment estimate. When analyzing the second moment 

E(V N (A)) 2 = ^(H' N (a l )eA,H' N {a 2 )eA), (5.48) 

(<ji-,a 2 )eX 2 

it is useful to distinguish between "typical" and "atypical" sets of configurations 
(cr\ . . . , </) G X e , a notion introduced in [BCMN05aj . 

Take some sequence On — * such that N9 2 N — * oo and consider the £ x n matrix 
C p (a 1 , . . . , a e ), introduced in Subsection 15. II Then all but a vanishing fraction of 
the configurations (a 1 , . . . , cr) E Sff obey the condition 



max 



n 



< n9 N . (5.49) 



2 f 

When M = o(N), Lemma T5.3I guarantees that for a properly chosen sequences 
dff, P-almost surely, all the sampled sets (a 1 , . . . , a 1 ) S X e obey condition (|5.49D . 
It is no longer the case when M = sN and thus we have to consider the contribution 
from the sets violating (|5.49p . 

Fix p = 2 and 1 = 2. Let a sequence 9 m — ► be given, and define 



a 1 ,<r : 



P^^jeA^jeA), (5.50) 



where the sum runs over all pairs of distinct configurations (er 1 , a 2 ) E X 2 satisfying 
condition ()5.49[) (the so called "typical" configurations). Also define 

n=^P(lf> 1 )eA,ij;(a 2 )e4 (5.51) 

where the sum is over all pairs of distinct configurations (a 1 , a 2 ) E X 2 violating 
(I5.49p (the "atypical" configurations). For later use we introduce the quantities 
I g and II g - the analogs of the variables I and II in the case where the random 
variables {gi 1 ,i 2 )i<ii,i2<N are i.i.d. standard normals. 

Lemma 5.6. Let 9n = ^ ^ 2 N ■ V £ € (0j 4 io g 2 ) then P -almost surely II = o(l) 
and 

-32C4E 2 log 2 2/ (• 4^2 

J = - .^ -(l + (5.52) 
Vl - 4e log 2 

Proof. To prove that 7J is o(l) let us bound the quantity II by 77 ff and show that 
I I a is almost surely negligible for On = 7= 1 = — . We start with the proof of the 
second statement, for which we will need the following simple observation. 

Consider a sequence Xn — ► such that NXn — > 00 and define the set of config- 
urations er G { — 1, 1} W with almost equal number of spins equal to 1 and to — 1 : 



T N = 



[a E S N : \#{cn = 1} - #{0* = -1}| < iVA^v}. (5.53) 

It is not hard to prove that configurations a 1 , a 2 6 7/v with overlap ^(cr 1 , er 2 )| < 
Xn must satisfy (|5.49|) with 9^ = \ 2 N . Therefore the set of pairs (a 1 , a 2 ) E X 2 
violating condition (|5.49[) with 9^ = X 2 ^ is contained in the set 

{V,cr 2 ) E X 2 : |#(ctV 2 )| > X N or a 1 E T£ or a 2 >E T^}. (5.54) 
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Thus to prove that II g is o(l) for 9n = ^ ^ 2 N ^ suffices to prove that 

Y, nn'A* 1 ) e A, H' N {a 2 ) e A) = 0(1), (5.55) 

where the summation is over all pairs of distinct configurations contained in the set 

(ESI With Ajv = - Afi/MosAT 

Let us prove (|5.55[) . Since we already proved in Section 0] that the contribution 
from the set 

{(a 1 , a 2 ) g X 2 : |iZ(<rV 2 )| > X N } (5.56) 

to the sum (|5.48[) is negligible, it is enough to consider the sum (|5.48|) restricted to 
the set 

{(a 1 , a 2 ) G X 2 : a 1 or a 2 G {T^) x and \R(a\a 2 )\ < Ajy}. (5.57) 



By Stirling's formula we obtain that \Tfi\ = -^2 N e~ NJ ( XN \l + 0{\%)) and 
using this fact one can prove, proceeding as in part (i) of Theorem 12. A\ that 

\(U) X \ =E|(T^) X |(1+ (1)) = yX 2 ^e-^ A -)(l + (l)). (5.58) 

Thus for large enough iV the contribution from the set (|5.57[) to the sum (|5.48[) is 
bounded by 

2* [A-2 2M e- N ^ -^re^ M log 2 ~ 1/2 log M) , ( .-,.r,o , 

V ttTV 2 2M v ' 

where the constant C is from ijiHT]) , Using that NJ(\ N ) = \N\ 2 N + O^^^ 
we can further bound (|5 . 59[) by 



2Cf./_H_ e -^A^(l-4 6 log2) (5 60) 

V ttN 

which is o(l) by the choice of A at and e. 

Our next step is to bound the sum II by II g . For this purpose we need to give 
an estimate of the joint density of H' N (a 1 ), H' N (a 2 ) that would be valid also for 
pairs (a 1 , a 2 ) violating condition (|5 .49|) . As we already noted for such [a 1 , a 2 ) the 
results of Subsection 15.11 cannot be applied directly since it is no longer true that 
max \R(<7, a')\ is o(l). Fortunately, we have only 2x2 covariance matrix B(a 1 , a 2 ) 

and using this fact we can easily adapt the results of Subsection 15.11 to the case 
where max \R(a,a')\ is not o(l). We start with formula (|5.27|) which, in the case 

1 = 2, can be rewritten as 
F(H' N (a j ) G (xj,Xj + dxj) for j = 1, 2) 

= b 2 N nd Xl dx 2 fT J] /5K, l2 )e 27r ^( /ia « +/2a «)d/ 1 d/ 2 . (5.61) 
We can rewrite the integral in the above expression as 

P /"OO 

/ / II # f ■ ^ z 2mni a dhdh, (5.62) 
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where S G { — 1, l} 2 - Since the function p is even we obtain 

J]p(f • S) ns = p{h + / 2 )"< 1 - 1 ' + "<- 1 - 1 >/3(/i - / 2 )"( 1 ,- 1 )+«(-i.D. (5.63) 
s 

One obvious relation between H(i,i), n,(_i i), nn-i) and nr-^i) is 

"(l,i) + "(-1 -1) + "(l-i) + = "• ( 5 - 64 ) 

The other one we obtain by noting that 

"(i,i) +"(-i -i) - "(-1,1) - "(l-i) = 2^°* = nRf 2 . (5.65) 

i,j 

Therefore 

"(1,1) + "(-1,-1) = |"(1 + Rl2)i 
"(l.-l) + "(-i,i) = I"(l - #12)- 
By Theorem 12.41 we conclude that P-a.s. max \R(a, <j')\ < 1 and therefore, for 

some positive constant c, 

"(1,1) + "(-1,-1) > "(1,-1) + "(-1,1) = cn - ( 5 - 67 ) 
The above inequality allows us to approximate (|5.62[) by 



(5.66) 



e 2nn(if- a -r,- a ) J^p({ 5 + iq ■ 8) n * dfrdf 2 + 0( e - Cin ) . (5.68) 
1 s 
Adapting the proof of Lemma 5.5 from [BCMNOSb we get 

f I 1 ^niit-oc-n-a.) Y[p(f.S + irj- S) ns dfrdh 



-Ml S 
-nG„, a (o) 



«^ (l + oM=) + Of -% (5.69) 
27m V V Jn/ \ n JJ 



Finally, we obtain that 

P(H' N (a j ) e (xj,Xj + dxj) for j = 1,2)= dx x dx 2 b n e- nGn -^ 



x 



VdetB r, ^( 1 



l + 0[^ r )+0[^L))+0{b 2 N ne- c ^). (5.70) 



2tt 

From (I5.44p we see that for some constant C 

nG n . 2 = f (a^a) + J^f^^f + 0{% > \ (a.fl^a) + C, (5.71) 
and thus the joint density of H' N (a 1 ) 1 H' N (a 2 ) is bounded by 



b 2 N ^dctB 
2tt 6 



^ s " a )/ 2 - c (l + o(^=) + O(^)) + 0(6^ne- ci "). (5.72) 



This last bound for the joint density clearly implies that the sum II could be 
bounded by II g plus an error resulting from the second term. But the cumulative 
error coming from the second term is of order 0(2 2M 6 2 v ne _Cl ") which is negligible 
even in the case limsupM/./V > 0. 

To prove the second statement of the lemma we will approximate the sum / by 
I g , which was already calculated in Section 0] We first notice that for sequences 
(a 1 , . . . , a 1 ) satisfying condition (|5.49|) R max (cr , . . . , a 1 ) — 0{9n)- Furthermore, 
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for configurations {a 1 ,..., a 1 ) obeying condition (|5.49|) it is possible to derive from 
(jOij) that 

G n4 (a) = ha, B~ l a) + c 4 £(l + 3(£ - 1))^ + o(^) + o(^9 N ) . (5.73) 

For the details of the derivation we refer to Subsection 5.4 of |BCMN05bj and 
in particular to formula (5.57) in there. Using formula (|5.73[) with £ = 2 and 
substituting it into (|5.35p we obtain that 

7 = / 3e - 32c ^ 2l °s 22 (l + o(^) + (^)). (5.74) 



This finishes the proof of Lemma 15.61 □ 

To conclude the calculation of the second moment we notice that summing / 
and II we get the second factorial moment 

-32c 4 e 2 log 2 2/ .,(A))2 

E(V N (A)) 2 = p= » (1 + (1)). (5.75) 

VI - 4elog2 



Assertion (i) of Theorem 15.51 is thus proven. 

3. Third moment estimate. To deal with the third moment 

E(V N (A)) 3 = PiKia^eA^^ia^eA^'xia^eA), (5.76) 

we use the same strategy as we used to calculate the second moment. In particular, 
we fix I = 3 and split the sum (|5.76[) in two parts: 

/= ^;^)eA^(^)eA^(<i 3 )ei), (5.77) 

(T 1 ,CT 2 ,<T 3 

where the sum runs over all sequences of distinct configurations (er , a 2 , a 3 ) G X 3 
satisfying condition (|5.49|) . and 

11 = J2 PWeA^^eA^^)^), (5.78) 

where the sum is over all sequences of distinct configurations (a 1 , a 2 , a 3 ) € X 3 
violating (|5T49|) . 

By exactly the same argument as in the calculation of the second moment the 
contribution from the "typical" collections, /, is bounded. We therefore concentrate 
on the analysis of the contribution from the "atypical" collections, II. Since we are 
interested only in the estimate of the third moment from above it suffices to bound 
the joint density P(i/j v (<7 : ') 6 (xj, xj + dxj) for j — 1, 2, 3). We start with formula 
(|5.27p which, in the case i = 3, can be rewritten as 

P(i5/^ v (er-') G (xj 7 Xj + dxj) for j — i,2,3) — b 3 N ndx\dx2dx 3 

II /5(«u, l2 , l 3)e 2 ^( /ia " ,+/2a ~ +/3Q ")d/irf/2d/3. (5.79) 

1 ,«2 ,«3 

We can rewrite the integral in the above expression as 

Yl p(f ■ 5) n ° e 2 * mf a dhdf 2 df 3 , (5.80) 
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where S G { — 1, l} 3 - Since the function p is even we obtain 

jjp(f ■ dr° = P (.h + h + h) ni p(fi +h- hr 

s 

*p{h-h + hY 3 p{.h-.f2-h) n \ 

where 

m = n(i,i,i) +n(_ li _ 1) _ 1 ), 

"-2 = + "(-1 -1,1)) 

n 3 = "(1,-1,1) + "(-1,1,-1)) 

By definition of the matrix C^o -1 , er 2 , <r 3 ) we have 



(5.81) 



(5.82) 



ri2 ~ ri3 — ri4 = nR\ 2 , 



n\ — ?i2 — ri3 +124 = nR 2 



23' 



«i — n 2 + m - ?i4 = nR 31 , 
nx +n 2 + n 3 + n 4 = n. 



(5.83) 



Solving the system 



From Theorem 



71 1 = i7l(l+i?f 2 +i? 2 3 +i?| 1 ), 

n 2 = ln(l + R 2 12 -R 2 23 -R 2 31 ), 

71 3 = |7l(l-i?f 2 -i? 2 3+i?2 i); 
_ 7l4-|7l(l-i? 2 2 +i? 2 3 -i?2 i ). 

we obtain that P-almost surely 

limsup max j(R{a 1 , cr 2 )) <e log 2. 



jV- 



(5.84) 



(5.85) 



Since the function J is monotone we obtain from (|5.85[) and from assumption 



e < 



8 log 2 



that P-a.s. 



limsup max \R(a 1 ,a 2 )\ < — . 



(5- 



It implies that min{?ii, 712, 713, 714} > cn for some positive constant c. It allows us 
to approximate the integral in (|5.79p by 

e2*»(«-«-i-«) JJ^(f . <J + tv . S) ns dfidf 2 df 3 + 0(e- cn ). (5.87) 
1 s 
Adapting the proof of Lemma 5.5 from [BCMNOSb we get 



in 



-nG„, 3 (a). 



Vdet^y 2 ) fl + of M +Q /o| 



(27T71) 3 / 2 



(5.. 



Next, after a little algebra, one can derive from (|5.44[) that for some constant C 



nG n j > -(a,B l a) 



C 



(5.89) 



and this estimate is enough to bound the joint density of H' N (a 1 ), H' N (a 2 ), H' N (a 3 ) 
by the joint density of Z]y(a 1 ), Zm{(t 2 ) 1 Zn{<J 3 ). Thus Theorem 15.51 is proved. □ 
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